ARITHMETIC PROGRESSIONS OF PRIMES IN SHORT INTERVALS 



CHUNLEI LIU 

Abstract. Green and Tao proved that the primes contains arbitrarily long arithmetic 
progressions. We show that, essentially the same proof leads to the following result: 
If N is sufficiently large and M is not too small compared with N, then the primes in 
the interval [N, N + M] contains many arithmetic progressions of length k. 



1. Introduction 

Let be a positive integer going to infinity. We write o(l) for any quantity wliicli 
tends to zero as N goes to infinity, and write 0(1) for any quantity wliicli lias a bound 
independent of N. Let w = w{N) ^ | log log be any function which tends to infinity 
with N, and let W := Hp^tuP be the product of the primes up to w. Let A be the 
VT-tricked von Mangoldt function defined by 

X r \og{Wn + 1) when Wn + 1 is prime 

[ otherwise. 

Let M be a large prime number. Define Z^/ := Z/MZ to be the finite field consisting 

of residue classes modulo M. We always identify Zj\/ with the set 

{A^ - M, - M + 1, ■ ■ ■ , - 1}, 
which is a complete system of representatives modulo M. 

If A is a finite non-empty set and / : A — > M is a function, we write 

E{f):=E{f{x)\xeA) 
for the average value of /, that is to say 



Here, as is usual, we write 1^41 for the cardinality of the set A. More generally, if P{x) 
is any statement concerning an element of A which is true for at least one x G A, we 
define 



E(/(x)|P(x)) : = 



\{xeA: P{x)}\' 
Let k be any fixed integer greater than 3, and let '■= l/2'^{k + 
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A famous theorem of Green- Tao in [3] asserts that the prime numbers contain arbitrarily 
long arithmetic progressions. In this paper we show that the proof of Green- Tao really 
yields the following theorem. 

Theorem 1.1. Let M be a function of N with values in the set of prime numbers 
which satisfies A^^ < M ^ N for some positive numbers. Suppose that on the interval 
[N + efcM, + 2efcM] the mean value of the W -tricked von Mongoldt function tends to 
1 as N goes to infinity. Define the function f on Zj\/ by setting 

k-^2'^-^K{n) when e^M ^n-N 2ekM 
otherwise. 



fin) 



Then there is a positive constant depending only on k such that 

E{f{x)f{x + r)...f{x + {k- l)r) \x,re Zm) ^ c,, - o(l). 

From that theorem we see that, for sufficiently large A^, there are at least fe^M^/ log'^ A^ 
arithmetic progressions of length k consisting of primes in the interval {WN, W{N+My\, 
where hk is a positive constant Ck depending only on k. According to Green- Tao, we 
can in fact take w to be a sufficiently large number independent of A^, depending only 
on k. Then W will be a constant depending only on k. 

AcknowledgementsThe author would like to thank the Morningside Center of Mathe- 
matics, Chinese Academy of Sciences for support over several years. 

2. The linear forms property 

In this section we construct a majorant u for / and prove that u satisfies the linear 
forms condition. 

Definition 2.1. Let i? be a parameter (in applications it will be a small power of A^). 
Define 

Anin) := \og{R/d) = \og{R/d)+. 

d\n d\n 
d^R 

These truncated divisor sums have been studied in several papers, most notably the 
works of Goldston and Yildirim [H El [3] concerning the problem of finding small gaps 
between primes. 

Definition 2.2. Let R := M^'^'^'''^*. We define the function v : Zm ^ M+ by 

I 1 otherwise 
for all A^ + M ^ n < A^ + M. 

Lemma 2.3. Let N be a sufficiently large integer depending on k. Then the function 
V is majorant for f in Theorem That is, v{n) ^ for all n G T^m, o,nd v{n) ^ 

k-^2-^-^k{n) for all N + e^M ^n^N + 2ekM . 

Proof. The first claim is trivial. The second claim is also trivial unless Wn -|- 1 is 
prime. From definition of i?, we see that Wn -|- 1 > i? if A^ is sufficiently large. Then 
the sum over (i|iyn -|- 1, ^ in (12.11) in fact consists of just the one term d = 1. 
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Therefore kR{Wn + 1) = logi?, which means that z/(n) = ^logi? ^ k-^2-^-^1{n) 
by construction of R and A^. □ 

Definition 2.4 (Linear forms condition). Let mQ,tQ and Lq be small positive integer 
parameters. Then we say that u : satisfies the (mo, to^ -^o)-liiiear forms 

condition if the following holds. Let m ^ mo and t ^ to be arbitrary, and suppose 
that {Lij)i^i^rn,i^^t are arbitrary rational numbers with numerator and denominator 
at most Lo in absolute value, and that fej, 1 ^ i ^ m, are arbitrary elements of Zm- 
For 1 ^ i ^ m, let V'i : ~^ the linear forms V'i(x) = Yl]=i ^ij^j + where 

X = (xi, . . . ,xt) G Z^, and where the rational numbers Lij are interpreted as elements 
of in the usual manner (assuming M is prime and larger than Lq). Suppose that 
as i ranges over 1, . . . , m, the t-tuples {Lij)i^j^t G Q* are non-zero, and no t-tuple is a 
rational multiple of any other. Then we have 

E (z/(^i(x)) . . . ^/(V'„^(x)) I X G Z*,) = 1 + (1). (2.1) 

Note that the rate of decay in the o(l) term is assumed to be uniform in the choice of 

The following propositions plays a crucial role in proving that u satisfies the linear forms 
condition. 

Proposition 2.5 (Goldston-Yildirim). Let m,t be positive integers. For each 1 ^ i ^ 
m, let := Xlj=i -^ij^i + bi, be linear forms with integer coefficients Lij such that 

\Lij\ ^ ^Jw{N)/2 for all i = 1, . . . m and j = 1, . . . ,t. We assume that the t-tuples 
{LijYj^i are never identically zero, and that no two t-tuples are rational multiples of each 
other. Write 6i := Wipi + 1. Suppose that B is a product JlLi C M* oft intervals f, 
each of which having length at least R^^^ . Then 

'WlogR'' '" 



E(Ak(^^i(x))^ . . . A^(^^„(x))^|x eB) = {l + o„,,i(l)) 



Remarks. That proposition was stated and proved by Green- Tao in |3], however, ac- 
cording to Green- Tao, it is a straightforward generalisation of P, Proposition 2]. 

Before proving the linear forms condition. We show that E(i/) = 1 -|- o(l). 
Lemma 2.6. We have E(i/) = 1 + o(l). 

Proof. Apply Proposition 12.51 with m := t := 1, ^pl{xl) := Xi and B := [N + e^M, A^ -|- 
2ekM] (taking A^ sufficiently large depending on k, of course). Comparing with Defini- 
tion [22] we thus have 

E(z/(x) I X G [A^ + ekM, N + 2efcM]) = 1 + o(l). 

But from the same definition we clearly have 

E(z/(x) I X G ZA/\[Ar + ekM, N + 2ekM]) = 1; 

Combining these two results confirms the lemma. □ 

Now we verify the linear forms condition, which is proven in a similar spirit to the above 
lemma. 
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Proposition 2.7. The function v satisfies the 3/;; — 4, k) -linear forms condition. 

Proof. Let ipi{x) = Yl]=i LijXj+bi be linear forms of the type which feature in Definition 
12.41 That is to say, we have m ^ k ■ 2^~^, t ^ 3k — 4, the Lij are rational numbers 
with numerator and denominator at most k in absolute value, and none of the t-tuples 
(-^ii)j=i is zero or is equal to a rational multiple of any other. We wish to show that 

E{iy{M^)) • • • HM^)) I X G ) = 1 + 0(1). (2.2) 

We may clear denominators and assume that all the Lij are integers, at the expense of 
increasing the bound on L^j to \Lij\ ^ (A; + 1)!. Since w{N) is growing to infinity in 
N, we may assume that (fc + 1)! < \Jw{N)/2 by taking sufficiently large. This is 
required in order to apply Proposition 12.51 as we have stated it. 

The two-piece definition of v in Definition 12.21 means that we cannot apply Proposition 
12.51 immediately, and we need the following localization argument. 

We chop the range of summation in (12. 2p into almost equal-sized boxes, where 
Q = Q{N) is a slowly growing function of to be chosen later. Thus let 

Bu„...,u, = {x G Z^^, : X, e[N+ [u,M/Q\ , N + [{u, + l)M/gj ), j = 1, . . . , t}, 

where the Uj are to be considered (modQ). Observe that up to negligible multiplicative 
errors of 1 + o(l) (arising because the boxes do not quite have equal sizes) the left-hand 
side of (12. 2p can be rewritten as 

E(E(z/(^i(x)) . . . z/(?/'m(x))|x G 5„^,...,«J|mi, . . . ,Mt G Zq). 

Call a t-tuple (mi, . . . ,Ut) G Zg nice if for every 1 ^ i ^ m, the sets 4'i{Bui^...,ut) are 
either completely contained in the interval [A^ -|- e^M, + 2ekM] or are completely 
disjoint from this interval. From Proposition 12.51 and Definition 12.21 we observe that 

E(i/(V'i(x))...KV'm(x))|xG5 ) = 1 + Om,t{^) 

whenever {ui, . . . ,Ut) is nice, since we can replace each of the z/('?/'j(x)) factors by either 
p^^^^A^(^j(x)) or 1, and M/Q will exceed R^^"'- for Q sufficiently slowly growing in A^, 
by definition of R and the upper bound on m. When (mi, . . . ,ut) is not nice, then we 
can crudely bound z/ by 1 + p^^^^A|,(6'j(x)), multiply out, and apply Proposition 12.51 
again to obtain 

E(z/(7/;i(x))...z/(^^(x))|xG S 

We shall shortly show that the proportion of non-nice t-tuples (mi, . . . ,Ut) in Zg is at 
most Om,i(l/Q), and thus the left-hand side of (12.21) is l + Om,t(l) +0m,t(l/<5), and the 
claim follows by choosing Q sufficiently slowly growing in A^. 

It remains to verify the claim about the proportion of non-nice t-tuples. Suppose 
(mi, . . . ,Ut) is not nice. Then there exists 1 ^ i ^ m and x, x' G Bui,...,ut such that 
ipiix) lies in the interval [A^ + e^M, A^ + 2efcM], but ipii'x.') does not. But from definition 
of Bui,...,ut{^^d the boundedness of the Lij) we have 

t 

^,(x),V'i(xO = J]L,,(Ar+ [Muj/Q\)+h + Om,t{M/Q). 
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Thus we must have 

t 

N + ae,M = J] U^{N + [Mu,/Q\) +h + 0^,t{M/Q) 

for either a = 1 or a = 2. Dividing by M/Q, we obtain 
t t 
^LijUj = (1 ~ bi)Q/^ + (^^kQ + (modQ). 

i=i i=i 

Since {L^Yj^^ is non-zero, the number of t-tuples {ui, . . . ,Ut) which satisfy this equation 
is at most Om,t(Q*~^)- Letting a and i vary we thus see that the proportion of non-nice 
t-tuples is at most Om,t{^/Q) as desired (the m and t dependence is irrelevant since 
both are functions of k). □ 



3. The correlation property 
In this section we show that u satisfies the correlation condition. 

Definition 3.1 (Correlation condition). Let mo be a positive integer parameter. We 
say that u : "Zm satisfies the mo- correlation condition if for every 1 < m ^ mo 

there exists a weight function r = '■ which obeys the moment conditions 

E(r'?) = 0^,,(1) (3.1) 

for all 1 ^ g < oo and such that 

E(z/(x + /ii)z/(x + /i2) . . .z^(a; + /i„) I X G Za/) ^ ^ T{hi - hj) (3.2) 

for all hi, . . . , hm & (not necessarily distinct). 

The following proposition plays a crucial roles in proving that v satisfies the correlation 
condition. 

Proposition 3.2 (Goldston-Yildirim). Letm 1 be an integer, and let B be an interval 
of length at least R^^"^. Suppose that hi, . . . ,hm are distinct integers satisfying \hi\ ^ N"^ 
for all 1 ^ i ^ m, and let A denote the integer 

Then 

E{An{W{xi + hi) + AR{W{xm + h^) + iflx G B) 
Here and in the sequel, p is always understood to be prime. 

Remarks. That proposition was stated and proved by Green- Tao in [Ij, however, Green- 
Tao attributed it to Goldston-Yildirim for reasons similar to Proposition 12.51 

In a short while we will use Proposition 13.21 to show that u satisfies the correlation 
condition. Prior to that, however, we must look at the average size of the "arithmetic" 
factor np|A(^ + Om{p~^^'^)) appearing in that proposition. 
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Lemma 3.3. Let m ^ 1 be a parameter. There is a weight function r = : Z ^ 
such that T{n) ^ 1 for all n 0, and such that for all distinct hi, . . . ,hj G [A^ + 
efcM, N + 2efcM] we have 

p|A l^i<j<m 

where A is defined in Proposition \3.S\ and such that E(r''(n)|0 < \n\ ^ M) = Om,<j(l) 
for all < q < CO . 



Proof. We observe that 



0™(1) 



p|A l<j<j^m ^ p\hi-hj 

By the arithmetic mean-geometric mean inequahty (absorbing all constants into the 
0^(1) factor) we can thus take r^(n) := 0^(1) Up\ni^ +p-V2)0„(i) ^jj ^ _^ q_ (^rj.^^ 

value of r at is irrelevant for this lemma since we are taking all the hi to be distinct). 
To prove the claim, it thus suffices to show that 



p\n 



-l/2\Om(q) 



< |n| ^ M = OrnM for all < g < oo. 



Since {1 + p i/^)*^™.!?) is bounded by 1 + p for all but Om,g(l) many primes p, we 
have 



E('j](l+p-i/2)«-(«) 0<|r2KM') ^0^,,(l)E('j](l+p-i/^) 

p\n p\n 



0<n^M 



But Upini^ + P-'^') < Y.d\nd-"\ and hence 



p\n 



Q<\n\^M] <: 0^„,,(1)^ E^"' 

l^|n.|^Af d|n 
M 



/4 



(i=l 



which is Om,q{^) as desired. 



□ 



We are now ready to verify the correlation condition. 

Proposition 3.4. The measure v satisfies the 2^^'^ -correlation condition. 

Proof. Let us begin by recalling what it is we wish to prove. For any 1 ^ m ^ 2^~^ and 
hi, . . . , hm & we must show a bound 

E(z/(a; + /ii)z/(a; + /la) . . + /im) I a; G Zat) ^ ^ rihi-hj), (3.4) 
where the weight function r = is bounded in for all q. 

Fix m, hi, ... , hm- We shall take the weight function constructed in Lemma [3.31 (iden- 
tifying Tjm with the integers between — M/2 and +M/2), and set 

r(0) := exp(Cmlog A^/loglog A^) 
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for some large absolute constant C. From the previous lemma we see that E(r'^) = 
0^,5(1) for all q, since the addition of the weight r(0) at only contributes 0^,^(1) at 
most. 

We first dispose of the easy case when at least two of the hi are equal. In this case 
we bound the left-hand side of fl2.2p crudely by But from Definitions 12.11 12.21 

and by standard estimates for the maximal order of the divisor function d{n) we have 
the crude bound ^ exp{C log N / log log A^), and the claim follows thanks to our 

choice of t(0). 

Suppose then that the hi are distinct. Write 

, , (j){W) AliWn + 1) 

~W log^R l[iV+efcAf,7V+2efeM]W- 

Then by construction of u (Definition 12.21) . we have 

E(z/(a; + hi) . . . u{x + hm) \ x E Zm) 

^E{{l + g{x + hi))...{l + g{x + hm)) \ xEZm). 
The right-hand side may be rewritten as 



J2 ^(llg{x + h, 

AC{l,...,m} ^i€A 



xeZ 



M 



Observe that for i,jEA we may assume \hi — hj\ ^ e^M, since the expectation vanishes 
otherwise. By Proposition 13.21 and Lemma 13.31 we therefore have 



E(llg{x + hi) xeZmJ^ Y1 



/l,)+Om(l). 

<j<:n 



Summing over all A, and adjusting the weights r by a bounded factor (depending only 
on m and hence on A;), we obtain the result. □ 



4. Proof of the main theorem 
In this section we conclude the proof of Theorem II. 1[ 

Definition 4.1. Let v : Zm be a function. We say that u is A;-pseudorandom 

measure if it obeys the estimate E(z/) = l+o(l) and satisfies the {k-2^'~^, 3k— A, A;)-linear 
forms condition as well as the 2''~^-correlation condition. 

Theorem 4.2 (Green- Tao). The function v : Zm in Definition \2.S\ is a k- 

pseudorandom measure that majorises f in Theorem \l.l\ 

Proof. That theorem follows from Lemmas 12.31 12.61 and Propositions 12.71 13. 4[ □ 

Remarks. I have attributed this theorem to Green and Tao, because the above argument 
is a straightforward generalisation of that of Proposition 9.1]. 

The proof of Theorem 11.11 is base on the following theorem. 
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Theorem 4.3 (Green- Tao). Let k ^ 3 and < 6 ^ 1 be fixed parameters. Suppose 
that V : — k -pseudorandom measure. Let f : IR^ be any non-negative 

function obeying the bound 

^ f{x) ^ u{x) for all x e Zm (4.1) 

and 

E(/) ^ 5. (4.2) 

r/ien we have 

E{f{x)f{x + r)...f{x+{k- l)r) \x, r G Zm) ^ c{k, 6) - Ok,s{l) (4.3) 
where c{k, 6) > stands for a constant depending only on k and 6. 

That theorem is a great generahzation of the foUowing theorem. 

Theorem 4.4 (Szemeredi's theorem). Let k ^ 3 and < 6 ^ 1 be fixed parameters. Let 

f : Tjm be any function which is bounded by a bound independent of M. Suppose 
that 

E(/) ^ 5. (4.4) 

Then we have 

E{f{x)f{x + r)...f{x + ik- l)r) |x, r G Z^) ^ c(A:, 5) - o,,5(l) (4.5) 

where c{k,6) > is the same constant which appears in Theorem \4.3[ {The decay rate 
Ofc,5(l), on the other hand, decays significantly faster than that in Theorem \4.3^ . 

Remarks. The k = 3 case of Szemeredi's theorem was estabhshed by Roth[5]. The 
general weU as the k = 4 case was proved by Szemeredi [HI [7] . The formulation 

here is different from the original one, but can be deduce from the original one. The 
argument was first worked out by Varnavides [9J ) . A direct proof of Theorem 14.41 can 
be found in [8] . 

Proof of Theorem By our assumption on M, we see that 

E(/) = ^^^^ Yl A(n) = fc-i2-'=-^e,(l + o(l)). 

We now apply Theorem 14.21 and Theorem 14.31 to conclude that 

E(/(x)/(a; + r) ...f{x + {k- l)r) | x, r G Zat) ^ c{k, k-^2~^-^ek) - o(l). 
Theorem 11.11 follows by setting = c{k, k~^2~'^~^ek). □ 
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